Skip to content

Data in Enterprise OMOP

Emory's OMOP Enterprise pipeline transforms clinical data from Epic and the Clinical Data Warehouse (CDW) into the OMOP Common Data Model. This section covers what's in the data, how it got there, and how we know it's right.

  • Data Mapping


    How source data flows from Epic and CDW into OMOP — the ELT pipeline, vocabulary mapping coverage, and custom concepts.

    Data Mapping

  • Data Quality


    Automated quality checks across 2,374 DQD tests (96.6% pass rate), 133 DBT tests, and a tracked list of known issues.

    Data Quality

  • Observed Conventions


    OHDSI community conventions, Emory-specific conventions, and documented adherence to standards across the pipeline.

    Observed Conventions

  • NLP Infrastructure


    Proposed span-based NLP schema extending the OMOP CDM — pipeline provenance, typed extractions, and _DERIVED tables for clean separation of NLP-derived data.

    NLP Infrastructure

  • Releases


    Version history from v0.2.0 through v1.0.0 — what changed, what was fixed, and what researchers should know.

    Releases

Data Mapping at a Glance

Area Pages
Pipeline Extract Load Transform (ELT) · Era Algorithms
Coverage Vocabulary Mapping Coverage
Extensions Custom Concepts · Requesting Mappings · Contributing Vocabularies